CVSgrab: Mining the History of Large Software Projects
نویسندگان
چکیده
Many software projects use Software Configuration Management systems to support their development process. Such systems accumulate in time large amounts of information useful for process accounting and auditing. We study how software developers can get insight in this information in order to understand the project context and the product artifacts. To this end, we propose several new techniques for visual mining of project evolution. Central to our approach is a file-based evolution visualization, where each project is shown as a set of horizontal stripes depicting files along the time axis. We propose several mechanisms for interactively building layouts in this display, and for correlating the evolution with the results of various software metrics. We demonstrate the usefulness of our approach on reallife data sets.
منابع مشابه
Mining Software Repositories for Evaluating Software Engineering Properties of Language Designs
Improved separation of concern is important for dealing with increasing complexity of today’s software systems. A number of language designs have been proposed in the last decade with the common goal to improve the separation of concerns by providing better modularization mechanisms e.g. mix-ins, units, roles, layers, hyperspaces, events, aspects, etc. To understand the benefits of a new modula...
متن کاملAutomatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining
Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...
متن کاملApplication of Data Mining Techniques on Software Engineering Data for Software Quality
The processes of Software engineering are complex and produces large number and variety of artifacts. The potential of data mining technique on this large valuable data is to better manage the software projects and to produce high-quality software systems that are delivered on time and within budget. This paper present the latest research in mining software engineering data, software engineerin...
متن کاملDeclarative Visitors to Ease Fine-grained Source Code Mining with Full History on Billions of AST Nodes by Robert Dyer, Hridesh Rajan, and Tien N. Nguyen
Software repositories contain a vast wealth of information about software development. Mining these repositories has proven useful for detecting patterns in software development, testing hypotheses for new software engineering approaches, etc. Specifically, mining source code has yielded significant insights into software development artifacts and processes. Unfortunately, mining source code at...
متن کاملSoftware Metrics in Boa Large-Scale Software Mining Infrastructure: Challenges and Solutions
In this paper, we describe our experience implementing some of classic software engineering metrics using Boa—a large-scale software repository mining platform—and its dedicated language. We also aim to take an advantage of the Boa infrastructure to propose new software metrics and to characterize open source projects by software metrics to provide reference values of software metrics based on ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006